Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 1299 |
| Missing cells | 355 |
| Missing cells (%) | 1.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 192.9 KiB |
| Average record size in memory | 152.1 B |
Variable types
| CAT | 9 |
|---|---|
| NUM | 9 |
| BOOL | 1 |
property_type has constant value "1299" | Constant |
country has constant value "1299" | Constant |
division has constant value "1299" | Constant |
city has constant value "1299" | Constant |
year_of_renovation has constant value "1299" | Constant |
current_zones has a high cardinality: 175 distinct values | High cardinality |
zone has a high cardinality: 76 distinct values | High cardinality |
century_zone has a high cardinality: 67 distinct values | High cardinality |
century_zone is highly correlated with zone | High correlation |
zone is highly correlated with century_zone | High correlation |
current_zones has 100 (7.7%) missing values | Missing |
zone has 100 (7.7%) missing values | Missing |
century_zone has 155 (11.9%) missing values | Missing |
df_index has unique values | Unique |
propertiesid has unique values | Unique |
price has 21 (1.6%) zeros | Zeros |
interior_area has 129 (9.9%) zeros | Zeros |
gros_area has 62 (4.8%) zeros | Zeros |
bedrooms has 84 (6.5%) zeros | Zeros |
bathrooms has 77 (5.9%) zeros | Zeros |
other_rooms has 1068 (82.2%) zeros | Zeros |
year_of_construction has 1266 (97.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-05-25 17:06:10.481583 |
|---|---|
| Analysis finished | 2021-05-25 17:06:23.436117 |
| Duration | 12.95 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 1299 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 661.7528868 |
|---|---|
| Minimum | 0 |
| Maximum | 1321 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Memory size | 10.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 65.9 |
| Q1 | 329.5 |
| median | 662 |
| Q3 | 995.5 |
| 95-th percentile | 1256.1 |
| Maximum | 1321 |
| Range | 1321 |
| Interquartile range (IQR) | 666 |
Descriptive statistics
| Standard deviation | 383.0182217 |
|---|---|
| Coefficient of variation (CV) | 0.5787934277 |
| Kurtosis | -1.207526282 |
| Mean | 661.7528868 |
| Median Absolute Deviation (MAD) | 333 |
| Skewness | -0.002293214572 |
| Sum | 859617 |
| Variance | 146702.9581 |
| Monotocity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1321 | 1 | 0.1% | |
| 452 | 1 | 0.1% | |
| 433 | 1 | 0.1% | |
| 434 | 1 | 0.1% | |
| 435 | 1 | 0.1% | |
| 436 | 1 | 0.1% | |
| 437 | 1 | 0.1% | |
| 438 | 1 | 0.1% | |
| 439 | 1 | 0.1% | |
| 440 | 1 | 0.1% | |
| Other values (1289) | 1289 | 99.2% |
| Value | Count | Frequency (%) | |
| 0 | 1 | 0.1% | |
| 1 | 1 | 0.1% | |
| 2 | 1 | 0.1% | |
| 3 | 1 | 0.1% | |
| 4 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 1321 | 1 | 0.1% | |
| 1320 | 1 | 0.1% | |
| 1319 | 1 | 0.1% | |
| 1318 | 1 | 0.1% | |
| 1317 | 1 | 0.1% |
| Distinct | 1299 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6519.856043 |
|---|---|
| Minimum | 2645 |
| Maximum | 17239 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 10.1 KiB |
Quantile statistics
| Minimum | 2645 |
|---|---|
| 5-th percentile | 2825.8 |
| Q1 | 3487 |
| median | 4499 |
| Q3 | 9615 |
| 95-th percentile | 16135.2 |
| Maximum | 17239 |
| Range | 14594 |
| Interquartile range (IQR) | 6128 |
Descriptive statistics
| Standard deviation | 4214.956015 |
|---|---|
| Coefficient of variation (CV) | 0.6464799202 |
| Kurtosis | 0.2293260881 |
| Mean | 6519.856043 |
| Median Absolute Deviation (MAD) | 1288 |
| Skewness | 1.209140774 |
| Sum | 8469293 |
| Variance | 17765854.2 |
| Monotocity | Strictly decreasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 4095 | 1 | 0.1% | |
| 10915 | 1 | 0.1% | |
| 4745 | 1 | 0.1% | |
| 2698 | 1 | 0.1% | |
| 2941 | 1 | 0.1% | |
| 14989 | 1 | 0.1% | |
| 2809 | 1 | 0.1% | |
| 2933 | 1 | 0.1% | |
| 4754 | 1 | 0.1% | |
| 2711 | 1 | 0.1% | |
| Other values (1289) | 1289 | 99.2% |
| Value | Count | Frequency (%) | |
| 2645 | 1 | 0.1% | |
| 2648 | 1 | 0.1% | |
| 2649 | 1 | 0.1% | |
| 2650 | 1 | 0.1% | |
| 2651 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 17239 | 1 | 0.1% | |
| 17224 | 1 | 0.1% | |
| 17215 | 1 | 0.1% | |
| 17206 | 1 | 0.1% | |
| 17204 | 1 | 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| Apartment |
|---|
| Value | Count | Frequency (%) | |
| Apartment | 1299 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
property_status
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| Used | |
|---|---|
| New | |
| Under Construction | 62 |
| Under construction | 46 |
| Not Applicable | 5 |
| Value | Count | Frequency (%) | |
| Used | 754 | 58.0% | |
| New | 431 | 33.2% | |
| Under Construction | 62 | 4.8% | |
| Under construction | 46 | 3.5% | |
| Not Applicable | 5 | 0.4% | |
| For refurbishment | 1 | 0.1% |
Frequencies of value counts
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Histogram of lengths of the category
Length
| Max length | 18 |
|---|---|
| Median length | 4 |
| Mean length | 4.880677444 |
| Min length | 3 |
availability
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| Withdrawn | |
|---|---|
| Available | |
| Sold | |
| Reserved | 6 |
| Rented | 1 |
| Other values (2) | 2 |
| Value | Count | Frequency (%) | |
| Withdrawn | 765 | 58.9% | |
| Available | 392 | 30.2% | |
| Sold | 133 | 10.2% | |
| Reserved | 6 | 0.5% | |
| Rented | 1 | 0.1% | |
| Potential | 1 | 0.1% | |
| In negotiation | 1 | 0.1% |
Frequencies of value counts
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.2% |
Histogram of lengths of the category
Length
| Max length | 14 |
|---|---|
| Median length | 9 |
| Mean length | 8.484988453 |
| Min length | 4 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| Albania |
|---|
| Value | Count | Frequency (%) | |
| Albania | 1299 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| Tirana |
|---|
| Value | Count | Frequency (%) | |
| Tirana | 1299 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| Tirana |
|---|
| Value | Count | Frequency (%) | |
| Tirana | 1299 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
| Distinct | 175 |
|---|---|
| Distinct (%) | 14.6% |
| Missing | 100 |
| Missing (%) | 7.7% |
| Memory size | 10.1 KiB |
| Don Bosko | |
|---|---|
| Rruga e Kavajes | 63 |
| 21 dhjetori | 55 |
| Komuna e Parisit | 50 |
| Ali Demi | 47 |
| Other values (170) |
| Value | Count | Frequency (%) | |
| Don Bosko | 102 | 7.9% | |
| Rruga e Kavajes | 63 | 4.8% | |
| 21 dhjetori | 55 | 4.2% | |
| Komuna e Parisit | 50 | 3.8% | |
| Ali Demi | 47 | 3.6% | |
| Fresku | 43 | 3.3% | |
| Kodra e Diellit | 40 | 3.1% | |
| Liqeni i Thate | 34 | 2.6% | |
| Astiri | 32 | 2.5% | |
| Fusha e Aviacionit | 28 | 2.2% | |
| Other values (165) | 705 | 54.3% | |
| (Missing) | 100 | 7.7% |
Frequencies of value counts
Unique
| Unique | 90 ? |
|---|---|
| Unique (%) | 7.5% |
Histogram of lengths of the category
Length
| Max length | 90 |
|---|---|
| Median length | 11 |
| Mean length | 13.852194 |
| Min length | 3 |
| Distinct | 76 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 100 |
| Missing (%) | 7.7% |
| Memory size | 10.1 KiB |
| Don Bosko | |
|---|---|
| 21 dhjetori | 68 |
| Rruga e Kavajes | 67 |
| Komuna e Parisit | 54 |
| Ali Demi | 53 |
| Other values (71) |
| Value | Count | Frequency (%) | |
| Don Bosko | 108 | 8.3% | |
| 21 dhjetori | 68 | 5.2% | |
| Rruga e Kavajes | 67 | 5.2% | |
| Komuna e Parisit | 54 | 4.2% | |
| Ali Demi | 53 | 4.1% | |
| Laprake | 46 | 3.5% | |
| Fresku | 43 | 3.3% | |
| Kodra e Diellit | 42 | 3.2% | |
| Astiri | 42 | 3.2% | |
| Unaza e re | 41 | 3.2% | |
| Other values (66) | 635 | 48.9% | |
| (Missing) | 100 | 7.7% |
Frequencies of value counts
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | 0.8% |
Histogram of lengths of the category
Length
| Max length | 25 |
|---|---|
| Median length | 10 |
| Mean length | 10.85450346 |
| Min length | 3 |
| Distinct | 67 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 155 |
| Missing (%) | 11.9% |
| Memory size | 10.1 KiB |
| Don Bosco | |
|---|---|
| 21 Dhjetori | 68 |
| Rruga e Kavajes | 67 |
| Komuna e Parisit | 54 |
| Ali Demi | 53 |
| Other values (62) |
| Value | Count | Frequency (%) | |
| Don Bosco | 108 | 8.3% | |
| 21 Dhjetori | 68 | 5.2% | |
| Rruga e Kavajes | 67 | 5.2% | |
| Komuna e Parisit | 54 | 4.2% | |
| Ali Demi | 53 | 4.1% | |
| Laprakë | 46 | 3.5% | |
| Fresku | 43 | 3.3% | |
| Unaza e Re | 42 | 3.2% | |
| Astiri | 42 | 3.2% | |
| Liqeni i Thatë | 34 | 2.6% | |
| Other values (57) | 587 | 45.2% | |
| (Missing) | 155 | 11.9% |
Frequencies of value counts
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.5% |
Histogram of lengths of the category
Length
| Max length | 27 |
|---|---|
| Median length | 9 |
| Mean length | 10.74210931 |
| Min length | 3 |
| Distinct | 383 |
|---|---|
| Distinct (%) | 29.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 93793.2679 |
|---|---|
| Minimum | 0 |
| Maximum | 1100000 |
| Zeros | 21 |
| Zeros (%) | 1.6% |
| Memory size | 10.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 42000 |
| Q1 | 60000 |
| median | 80000 |
| Q3 | 110000 |
| 95-th percentile | 192300 |
| Maximum | 1100000 |
| Range | 1100000 |
| Interquartile range (IQR) | 50000 |
Descriptive statistics
| Standard deviation | 60728.80667 |
|---|---|
| Coefficient of variation (CV) | 0.6474751124 |
| Kurtosis | 62.94105857 |
| Mean | 93793.2679 |
| Median Absolute Deviation (MAD) | 23000 |
| Skewness | 5.142901065 |
| Sum | 121837455 |
| Variance | 3687987960 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 85000 | 39 | 3.0% | |
| 55000 | 33 | 2.5% | |
| 75000 | 33 | 2.5% | |
| 65000 | 30 | 2.3% | |
| 80000 | 27 | 2.1% | |
| 60000 | 27 | 2.1% | |
| 95000 | 26 | 2.0% | |
| 105000 | 22 | 1.7% | |
| 0 | 21 | 1.6% | |
| 70000 | 19 | 1.5% | |
| Other values (373) | 1022 | 78.7% |
| Value | Count | Frequency (%) | |
| 0 | 21 | 1.6% | |
| 102 | 1 | 0.1% | |
| 1200 | 2 | 0.2% | |
| 1500 | 1 | 0.1% | |
| 6000 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 1100000 | 1 | 0.1% | |
| 530000 | 1 | 0.1% | |
| 420000 | 1 | 0.1% | |
| 380000 | 1 | 0.1% | |
| 379000 | 1 | 0.1% |
| Distinct | 147 |
|---|---|
| Distinct (%) | 11.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84.22324865 |
|---|---|
| Minimum | 0 |
| Maximum | 775 |
| Zeros | 129 |
| Zeros (%) | 9.9% |
| Memory size | 10.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 64 |
| median | 88 |
| Q3 | 106 |
| 95-th percentile | 140 |
| Maximum | 775 |
| Range | 775 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 43.99840011 |
|---|---|
| Coefficient of variation (CV) | 0.522402078 |
| Kurtosis | 47.31948931 |
| Mean | 84.22324865 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 2.936388499 |
| Sum | 109406 |
| Variance | 1935.859213 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 129 | 9.9% | |
| 90 | 38 | 2.9% | |
| 100 | 34 | 2.6% | |
| 95 | 26 | 2.0% | |
| 92 | 25 | 1.9% | |
| 80 | 22 | 1.7% | |
| 110 | 22 | 1.7% | |
| 94 | 21 | 1.6% | |
| 93 | 20 | 1.5% | |
| 102 | 20 | 1.5% | |
| Other values (137) | 942 | 72.5% |
| Value | Count | Frequency (%) | |
| 0 | 129 | 9.9% | |
| 20 | 1 | 0.1% | |
| 28 | 2 | 0.2% | |
| 32 | 1 | 0.1% | |
| 33 | 2 | 0.2% |
| Value | Count | Frequency (%) | |
| 775 | 1 | 0.1% | |
| 290 | 1 | 0.1% | |
| 270 | 1 | 0.1% | |
| 258 | 1 | 0.1% | |
| 247 | 1 | 0.1% |
| Distinct | 150 |
|---|---|
| Distinct (%) | 11.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.67590454 |
|---|---|
| Minimum | 0 |
| Maximum | 393 |
| Zeros | 62 |
| Zeros (%) | 4.8% |
| Memory size | 10.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 31 |
| Q1 | 73 |
| median | 98 |
| Q3 | 114 |
| 95-th percentile | 150 |
| Maximum | 393 |
| Range | 393 |
| Interquartile range (IQR) | 41 |
Descriptive statistics
| Standard deviation | 38.33298832 |
|---|---|
| Coefficient of variation (CV) | 0.4048864229 |
| Kurtosis | 5.617280093 |
| Mean | 94.67590454 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.6164140847 |
| Sum | 122984 |
| Variance | 1469.417994 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 62 | 4.8% | |
| 100 | 53 | 4.1% | |
| 110 | 28 | 2.2% | |
| 120 | 27 | 2.1% | |
| 105 | 27 | 2.1% | |
| 104 | 26 | 2.0% | |
| 90 | 26 | 2.0% | |
| 115 | 25 | 1.9% | |
| 70 | 24 | 1.8% | |
| 65 | 23 | 1.8% | |
| Other values (140) | 978 | 75.3% |
| Value | Count | Frequency (%) | |
| 0 | 62 | 4.8% | |
| 20 | 1 | 0.1% | |
| 28 | 1 | 0.1% | |
| 31 | 2 | 0.2% | |
| 33 | 2 | 0.2% |
| Value | Count | Frequency (%) | |
| 393 | 1 | 0.1% | |
| 298 | 1 | 0.1% | |
| 290 | 1 | 0.1% | |
| 270 | 1 | 0.1% | |
| 250 | 3 | 0.2% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.82986913 |
|---|---|
| Minimum | 0 |
| Maximum | 8 |
| Zeros | 84 |
| Zeros (%) | 6.5% |
| Memory size | 10.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8624154802 |
|---|---|
| Coefficient of variation (CV) | 0.471298994 |
| Kurtosis | 2.701188702 |
| Mean | 1.82986913 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.3490100705 |
| Sum | 2377 |
| Variance | 0.7437604605 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) | |
| 2 | 708 | 54.5% | |
| 1 | 301 | 23.2% | |
| 3 | 175 | 13.5% | |
| 0 | 84 | 6.5% | |
| 4 | 23 | 1.8% | |
| 5 | 7 | 0.5% | |
| 8 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 84 | 6.5% | |
| 1 | 301 | 23.2% | |
| 2 | 708 | 54.5% | |
| 3 | 175 | 13.5% | |
| 4 | 23 | 1.8% |
| Value | Count | Frequency (%) | |
| 8 | 1 | 0.1% | |
| 5 | 7 | 0.5% | |
| 4 | 23 | 1.8% | |
| 3 | 175 | 13.5% | |
| 2 | 708 | 54.5% |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.344110855 |
|---|---|
| Minimum | 0 |
| Maximum | 4 |
| Zeros | 77 |
| Zeros (%) | 5.9% |
| Memory size | 10.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 4 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.6037775588 |
|---|---|
| Coefficient of variation (CV) | 0.4492022044 |
| Kurtosis | -0.05604529003 |
| Mean | 1.344110855 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.01546653625 |
| Sum | 1746 |
| Variance | 0.3645473406 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=5)
| Value | Count | Frequency (%) | |
| 1 | 709 | 54.6% | |
| 2 | 504 | 38.8% | |
| 0 | 77 | 5.9% | |
| 3 | 7 | 0.5% | |
| 4 | 2 | 0.2% |
| Value | Count | Frequency (%) | |
| 0 | 77 | 5.9% | |
| 1 | 709 | 54.6% | |
| 2 | 504 | 38.8% | |
| 3 | 7 | 0.5% | |
| 4 | 2 | 0.2% |
| Value | Count | Frequency (%) | |
| 4 | 2 | 0.2% | |
| 3 | 7 | 0.5% | |
| 2 | 504 | 38.8% | |
| 1 | 709 | 54.6% | |
| 0 | 77 | 5.9% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3833718245 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 1068 |
| Zeros (%) | 82.2% |
| Memory size | 10.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.9301824926 |
|---|---|
| Coefficient of variation (CV) | 2.426319393 |
| Kurtosis | 6.810895922 |
| Mean | 0.3833718245 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.620299753 |
| Sum | 498 |
| Variance | 0.8652394695 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) | |
| 0 | 1068 | 82.2% | |
| 2 | 88 | 6.8% | |
| 1 | 67 | 5.2% | |
| 3 | 58 | 4.5% | |
| 4 | 11 | 0.8% | |
| 5 | 5 | 0.4% | |
| 6 | 2 | 0.2% |
| Value | Count | Frequency (%) | |
| 0 | 1068 | 82.2% | |
| 1 | 67 | 5.2% | |
| 2 | 88 | 6.8% | |
| 3 | 58 | 4.5% | |
| 4 | 11 | 0.8% |
| Value | Count | Frequency (%) | |
| 6 | 2 | 0.2% | |
| 5 | 5 | 0.4% | |
| 4 | 11 | 0.8% | |
| 3 | 58 | 4.5% | |
| 2 | 88 | 6.8% |
| Distinct | 14 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51.21170131 |
|---|---|
| Minimum | 0 |
| Maximum | 2022 |
| Zeros | 1266 |
| Zeros (%) | 97.5% |
| Memory size | 10.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2022 |
| Range | 2022 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 317.3217962 |
|---|---|
| Coefficient of variation (CV) | 6.196275227 |
| Kurtosis | 34.52963336 |
| Mean | 51.21170131 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.039520884 |
| Sum | 66524 |
| Variance | 100693.1223 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=14)
| Value | Count | Frequency (%) | |
| 0 | 1266 | 97.5% | |
| 2020 | 12 | 0.9% | |
| 2021 | 9 | 0.7% | |
| 2014 | 2 | 0.2% | |
| 2022 | 1 | 0.1% | |
| 2019 | 1 | 0.1% | |
| 2013 | 1 | 0.1% | |
| 2010 | 1 | 0.1% | |
| 2008 | 1 | 0.1% | |
| 2006 | 1 | 0.1% | |
| Other values (4) | 4 | 0.3% |
| Value | Count | Frequency (%) | |
| 0 | 1266 | 97.5% | |
| 1990 | 1 | 0.1% | |
| 1998 | 1 | 0.1% | |
| 2000 | 1 | 0.1% | |
| 2001 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 2022 | 1 | 0.1% | |
| 2021 | 9 | 0.7% | |
| 2020 | 12 | 0.9% | |
| 2019 | 1 | 0.1% | |
| 2014 | 2 | 0.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | propertiesid | property_type | property_status | availability | country | division | city | current_zones | zone | century_zone | price | interior_area | gros_area | bedrooms | bathrooms | other_rooms | year_of_construction | year_of_renovation | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 17239 | Apartment | Used | Available | Albania | Tirana | Tirana | Don Bosko | Don Bosko | Don Bosco | 56000 | 68 | 77 | 1 | 1 | 0 | 0 | 0 |
| 1 | 1 | 17224 | Apartment | Under Construction | Available | Albania | Tirana | Tirana | Liqeni i Thate | Liqeni i Thate | Liqeni i Thatë | 140000 | 94 | 109 | 2 | 2 | 0 | 0 | 0 |
| 2 | 2 | 17215 | Apartment | Used | Available | Albania | Tirana | Tirana | 9 kateshet | 9 kateshet | 9 Katëshet | 147000 | 112 | 112 | 2 | 2 | 0 | 0 | 0 |
| 3 | 3 | 17206 | Apartment | Used | Available | Albania | Tirana | Tirana | Oxhaku |##| Xhamlliku | Oxhaku | Oxhaku | 89000 | 0 | 98 | 0 | 0 | 0 | 0 | 0 |
| 4 | 4 | 17204 | Apartment | Used | Available | Albania | Tirana | Tirana | 21 dhjetori | 21 dhjetori | 21 Dhjetori | 55000 | 63 | 63 | 1 | 1 | 0 | 0 | 0 |
| 5 | 5 | 17195 | Apartment | Used | Available | Albania | Tirana | Tirana | Selvia | Selvia | Selvia | 140000 | 0 | 100 | 2 | 2 | 0 | 0 | 0 |
| 6 | 6 | 17168 | Apartment | Used | Available | Albania | Tirana | Tirana | Laprake | Laprake | Laprakë | 47400 | 60 | 60 | 1 | 1 | 0 | 0 | 0 |
| 7 | 7 | 17166 | Apartment | New | Available | Albania | Tirana | Tirana | 21 dhjetori | 21 dhjetori | 21 Dhjetori | 110000 | 79 | 87 | 2 | 1 | 0 | 0 | 0 |
| 8 | 8 | 17154 | Apartment | Used | Available | Albania | Tirana | Tirana | Blloku | Blloku | Blloku | 200000 | 141 | 155 | 3 | 2 | 0 | 0 | 0 |
| 9 | 9 | 17148 | Apartment | Used | Available | Albania | Tirana | Tirana | Blloku | Blloku | Blloku | 207000 | 128 | 137 | 3 | 2 | 0 | 2006 | 0 |
Last rows
| df_index | propertiesid | property_type | property_status | availability | country | division | city | current_zones | zone | century_zone | price | interior_area | gros_area | bedrooms | bathrooms | other_rooms | year_of_construction | year_of_renovation | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1289 | 1312 | 2660 | Apartment | New | Sold | Albania | Tirana | Tirana | Unaza e re | Unaza e re | Unaza e Re | 55000 | 96 | 110 | 3 | 2 | 2 | 0 | 0 |
| 1290 | 1313 | 2659 | Apartment | Used | Withdrawn | Albania | Tirana | Tirana | Liqeni I Tiranes |##| Liqeni i Thate | Liqeni I Tiranes | Liqeni i Tiranës | 94000 | 99 | 99 | 3 | 1 | 2 | 0 | 0 |
| 1291 | 1314 | 2656 | Apartment | New | Sold | Albania | Tirana | Tirana | Don Bosko | Don Bosko | Don Bosco | 98000 | 108 | 118 | 2 | 2 | 1 | 0 | 0 |
| 1292 | 1315 | 2654 | Apartment | Used | Available | Albania | Tirana | Tirana | Ali Demi | Ali Demi | Ali Demi | 65000 | 120 | 0 | 4 | 0 | 3 | 0 | 0 |
| 1293 | 1316 | 2652 | Apartment | Used | Sold | Albania | Tirana | Tirana | Don Bosko | Don Bosko | Don Bosco | 55000 | 72 | 80 | 3 | 1 | 2 | 0 | 0 |
| 1294 | 1317 | 2651 | Apartment | Used | Sold | Albania | Tirana | Tirana | Blv. Zogu i Pare | Blv. Zogu i Pare | Blv. Zogu i Pare | 74000 | 74 | 80 | 3 | 2 | 2 | 0 | 0 |
| 1295 | 1318 | 2650 | Apartment | Used | Sold | Albania | Tirana | Tirana | Don Bosko | Don Bosko | Don Bosco | 84000 | 103 | 111 | 3 | 2 | 2 | 0 | 0 |
| 1296 | 1319 | 2649 | Apartment | Used | Sold | Albania | Tirana | Tirana | Tirana e Re | Tirana e Re | Tirana e Re | 198000 | 138 | 0 | 4 | 2 | 3 | 0 | 0 |
| 1297 | 1320 | 2648 | Apartment | Used | Withdrawn | Albania | Tirana | Tirana | Tirana e Re | Tirana e Re | Tirana e Re | 95000 | 105 | 115 | 3 | 2 | 2 | 0 | 0 |
| 1298 | 1321 | 2645 | Apartment | New | Sold | Albania | Tirana | Tirana | Kashar | Kashar | Kashar | 22000 | 48 | 0 | 1 | 1 | 0 | 0 | 0 |